On the Number of Experiments Sufficient and in the Worst Case Necessary to Identify All Causal Relations Among N Variables
نویسندگان
چکیده
We show that if any number of variables are allowed to be simultaneously and independently randomized in any one experiment, log2(N) + 1 experiments are sufficient and in the worst case necessary to determine the causal relations among N ≥ 2 variables when no latent variables, no sample selection bias and no feedback cycles are present. For all K, 0 < K < 1 2N we provide an upper bound on the number experiments required to determine causal structure when each experiment simultaneously randomizes K variables. For large N , these bounds are significantly lower than the N − 1 bound required when each experiment randomizes at most one variable. For kmax < N 2 , we show that ( N kmax −1)+ N 2kmax log2(kmax) experiments are sufficient and in the worst case necessary. We offer a conjecture as to the minimal number of experiments that are in the worst case sufficient to identify all causal relations among N observed variables that are a subset of the vertices of a DAG.
منابع مشابه
Almost Optimal Intervention Sets for Causal Discovery
We conjecture that the worst case number of experiments necessary and sufficient to discover a causal graph uniquely given its observational Markov equivalence class can be specified as a function of the largest clique in the Markov equivalence class. We provide an algorithm that computes intervention sets that we believe are optimal for the above task. The algorithm builds on insights gained f...
متن کاملA Tight Upper Bound on the Number of Variables for Average-Case k-Clique on Ordered Graphs
A first-order sentence φ defines k-clique in the average-case if limn→∞ PrG=G(n,p) [ G |= φ ⇔ G has a k-clique ] = 1 where G = G(n, p) is the Erdős-Rényi random graph with p = p(n) being the exact threshold such that Pr[G(n, p) has a k-clique] = 1/2. A question of interest is: How many variables are required to define average-case k-clique in first-order logic? Beyond just the usual language of...
متن کاملIdentify Variables Affecting Rural Participation in Cooperatives Forest by-Products (A case study of Ilam city)
The current research is an applied study, which uses field data and avails itself of a descriptive-correlational and causal-correlation design to analyse the relationships between variables. This study was an attempt to identify variables affecting rural participation in cooperatives forest by-products in Ilam. Population of this research is exploiters of forests' by products in Ilam. The numbe...
متن کاملIdentify systematic relationships of effective social parameters on MSW management (Case study: Tehran, urban 22-district)
It seems that success in municipal wastes management has a detectable relationship with citizens' participation and social mechanisms. So, the present study has done for recognition of systematic relations of effective social parameters on municipal solid waste administration. Data have collected in two phase to fill of questionnaires including awareness assessment of MSW office's personnel of ...
متن کاملAutomatic Discovery of Latent Variable Models
Much of our understanding of Nature comes from theories about unobservable entities. Identifying which hidden variables exist given measurements in the observable world is therefore an important step in the process of discovery. Such an enterprise is only possible if the existence of latent factors constrains how the observable world can behave. We do not speak of atoms, genes and antibodies be...
متن کامل